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Abstract 

Reconstruction problems have been studied in a number of contexts including biology, information 
theory and and statistical physics. We consider the reconstruction problem for random fc-colourings 
on the A-ary tree for large k. Bhatnagar et. al. [2] showed non-reconstruction when A < i/clogfc — 
o{k log k). We tighten this result and show non-reconstruction when A < k [log k + log log fc + 1 — In 2 — 
o(l)] which is very close to the known upper bound on the number of colours needed for reconstruction, 
A > fc[logfc + loglogfc+ 1 + o(l)]. 

1 Introduction 

Determining the reconstruction threshold of a Markov random field has been of interest in a number of 
areas including biology, information theory and and statistical physics. Reconstruction thresholds on trees 
are believed to determine the dynamic phase transitions in many constraint satisfaction problems including 
random K-SAT and random colourings on random graphs. It is thought that at this poin the space of solutions 
splits into exponentially many clusters. The properties of the space of solutions of these problems are of 
interest to physicists, probabilists and theoretical computer scientists. 

The upper bound on the number of colours required for reconstruction of k [log k + log log A; + l + o(l)] > A 
was known from the work of fT9l and |[2T1 and is given by the analysis of a naive reconstruction algorithm 
which reconstructs the root only when it is known with absolute certainty given the leaves. The problem of 
finding good lower bounds on the number of colours required for reconstruction is more difficult, it requires 
showing that the spins on the root and the leaves are asymptotically independent. The best previous rigorous 
results for a lower bound on the number of colours needed for reconstruction was A < ^A; log k — o{k log k) 
in m. We improve this to A < /c[log k + log log /c + 1 — In 2 — o(l)]. Even at a heuristic level no lower 
bound as good as ours was known. 



1.1 Definitions 

We begin by giving a general description of broadcast models on trees and the reconstruction problem. The 
broadcast model on a tree T is a model in which information is sent from the root p across the edges, which 
act as noisy channels, to the leaves of T. For some given finite set of characters C a configuration on T is an 
element of C^, that is an assignment of a character C to each vertex. The broadcast model is a probability 
distribution on configurations defined as follows. Some |C| x |C| probability transition matrix M is chosen 
as the noisy channel on each edge. The spin cxp is chosen from C according to some initial distribution and 
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is then is propagated along the edges of the tree according to the transition matrix M. That is if vertex u is 
the pai^ent of v in the tree then spin at v is defined according to the probabilities 

P{(yv = jWu = i) = Mij. 

We will focus on the colouring model with \C\ = k which is given by the transition matrix Mj = 
Broadcast models and in particular colourings can also be considered as Gibbs measures on trees. Given a 
finite set of colours k and a graph T = {V, E) a /c-colouring is an assignment of a colour to each vertex so 
that adjacent vertices have different colours. The random A;-coluring model is then the uniform probability 
distribution on valid /c-colourings of the graph. It is a Gibbs measure or Markov random field on the space 
of configurations a G {1, . . . , /c}^ given by 

^(^)=| n w<x„ 

{u,v)&E 

where Z is a normalizing constant given by the number of colourings of T. On an infinite tree more than 
one Gibbs measure may exist, the broadcast colouring model corresponds to the free Gibbs measure. 
We will restrict our attention to A-ary trees, that is the infinite rooted tree where every vertex has A offspring. 
Let L{n) denote the spins at distance n from the root and let U{n) denote L{n) conditioned on Up = i. 

Definition 1 We say that a model is reconstructible on a tree T if for some i,j G C, 

limsupdTv{L''{n),L^{n)) > 

n 

where dxv the total variation distance. When the limsup is we will say the model has non-reconstruction 
on T. 

Non-reconstruction is equivalent to the mutual information between ap = L(0) and L{n) going to as n 
goes to infinity and also to {L(n)}^^ having a non-trivial tail sigma-field. More equivalent formulations 
are given in |18] Proposition 2.1. 

In contrast consider the uniqueness property of a model. 
Definition 2 We say that a model has uniqueness on a tree T if 

limsup sup dTv{P{(^p = ■|^(^) = L),P{(Tp = ■\L{n) = L')) > 

n L,L' 

where the supremum is over all configurations L,L' on the vertices at distance nfrom the root. 

Reconstruction implies non-uniqueness and is a strictly stronger condition. Essentially uniqueness says that 
there is some configuration on the leaves which provides information on the root while reconstruction says 
that a typical configuration on the leaves provides information on the root. 

1.2 Background 

For some parameterized collection of models the key question in studying reconstruction is finding which 
models have reconstruction, which typically involves finding a threshold. This problem naturally arises in 
biology, information theory and statistical physics and involves the trade off between increasing numbers of 
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leaves with increasingly noisy information as the distance from the root to the leaves increases. The simplest 
collection of model is the binary symmetric channel which is defined on two characters with 



for < e < 2 which corresponds to the ferromagnetic Ising model on the tree with no external field. It was 
shown in 131 and HI that this channel has reconstruction if and only if A(l — 2e)^ > 1. 
The broadcast model is a natural model for the evolution of characters of DNA. In phylogenetic reconstruc- 
tion the goal is to reconstruct the ancestry tree of a collection of species given their genetic data. Daskalakis, 
Mossel and Roch 1(171 161 proved the conjecture of Mike Steel that the number of samples required for phylo- 
genetic reconstruction undergoes a phase transition at the reconstruction threshold for the binary symmetric 
channel. 

Exact reconstruction thresholds have only been calculated in the binary symmetric model and binary asym- 
metric models with sufficiently small asymmetry lH. In both these cases the threshold corresponds to the 
Kesten-Stigam bound ifTTl . The Kesten-Stigam bound shows that reconstruction holds whenever AA2(M)^ > 
1 where A2(M) denotes the second largest eigenvalue of M. In fact when AA2(M)^ > 1 it is possible to 
asymptotically reconstruct the root from just knowing the number of times each character appears on the 
leaves (census reconsti"uction) without using the information on their positions on the leaves. Mossel llT6llT8l 
showed that the Kesten-Stigam bound is not the bound for reconstruction in the binary-asymmetric model 
with sufficiently large asymmetry or in the Potts model with sufficiently many characters. 
It was shown by fTO\ that fc-colourings have uniqueness on A-ary trees if and only if A; > A + 2 which is 
a lower bound on the reconstruction threshold. Exactly finding the threshold for reconstruction is difficult 
so most attention has been focused on finding its asymptotics as the number of colours and the degree 
goes to in infinity. Recently El greatly improved this bound showing that non-reconstruction holds when 
k > CA/ log A (or equivalently when A < C~^fc log k) for C > 2. On the other hand |fT9| showed that 
when A > A; log k + o{k log k) then with high probability in k the spin of the root is exactly determined by 
the leaves and so reconstruction is possible. With a more detailed analysis this argument can be improved 
to show reconstruction when k[log k + log log A; + 1 + o(l)] > A, as was shown in |[2T1 . 
This is a large improvement on the Kesten-Stigam bound which implies reconstruction when A > (fc — 
1)^. In related work Mezard and Montanari jlSJ found a variational principle which establishes bounds 
on reconstruction for colourings. However, except in the case of A: = 5 this does not improve upon the 
bound from Lemma|7l see 112111221 . Our results establish far tighter bounds with the upper and lower bounds 
differing by just k log 2 rather than | A; log k. 

Theorem 1 The k-colouring model on the lS.-ary tree has reconstruction when 

A > A;[log A; + loglog A; + 1 + o(l)]. 

and non-reconstruction when 



1.3 Applications to Statistical Physics 

The reconstruction threshold on trees is believed to play a critical role in the dynamic phase transitions 
in certain glassy systems given by random constraint satisfaction problems such as random K-SAT and 
random colourings on random graphs. We will briefly describe what is conjectured by physicists about such 




A < A:[logA: + loglogA; + 1 -ln(2) - o(l)]. 
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systems |[T2ll22l . generally without rigorous proof, and why understanding the reconstruction threshold for 
colourings plays an important role in such systems. 

The Erdos-Renyi graph G{n,p) is a random graph on n vertices where every pair of vertices is connected 
with probability p. To maintain constant average degree A we let p = A/n. The A;-colouring model on 
G(n, A/n) or random A-regular graphs undergoes several phase transitions as A grows. If we consider 
the space of solutions to a random colouring model where two colourings are adjacent if they differ at a 
single vertex then for the smallest values of A the space of solutions forms a large connected component. 
Above the clustering transition A^ the space of solutions breaks into exponentially many disconnected 
clusters and has no giant component with a constant fraction of the probability. This replica symmetry 
breaking transition is believed to occur at A^ = k[logk + log log A; + a + o(l)] with differing values of 
a conjectured: a = 1 — In 2 in |[T3l and q = 1 in lfT2l . In a recent remarkable result [H rigorously 
proved that when (1 + o(l))fe log k < A < (2 — o(l))fc log k then the space of solutions indeed breaks into 
exponentially many small clusters. A second transition occurs when most clusters have frozen spins, that is 
vertices which have the same colour in every colouring in the cluster. This phase transition is believed to 
occur at A^ = A;[log k + log log k + l + o(l)] 12111221 and is the best upper bound known for Ad- Two more 
transitions are believed to occur, condensation where the size of the clusters takes on a Poisson-Dirichlet 
process and the colouring threshold beyond which no more colourings are possible, aie conjectured to occur 
at Ac = 2k log fc - log A; - 2 log 2 + o(l) and A^ = 2A; log A; - log A; - 1 + o(l) respectively lEH. Similar 
results are also expected to hold for K-SAT and other random constraint satisfiability problems llT2l . 
Both random regular and Erdos-Renyi random graphs are locally tree-like. Asymptotically in a random 
regular graph the neigbourhood of a random vertex is a random regular tree and for Erdos-Renyi random 
graphs it is a Galton- Watson branching process tree with Poisson offspring distribution. It is conjectured 
|[T2l that the reconstruction threshold on the corresponding tree is exactly the clustering threshold Ad on 
the random graph. As such rigorous estimates of the reconstruction problem can be seen as part of a larger 
program of understanding glassy systems from constraint satisfaction problems. 

The clustering threshold is also believed to play an important role in the efficiency of MCMC algorithms for 
finding and sampling from colourings of the graphs. MCMC algorithms are believed to be efficient up to the 
clustering threshold but experience an exponential slowdown beyond it |[T2l . This is to be expected since a 
local MCMC algorithm can not move between clusters each of which has exponentially small probability. 
Rigorous proofs of rapid mixing of MCMC algorithms, such as the Glauber dynamics, fall a long way 
behind. For random regular graphs, results of [7] imply rapid mixing when k > 1.49A, well below the 
reconstruction threshold and even the uniqueness threshold. Even less is known for Erdos-Renyi random 
graphs as almost all MCMC results are given in terms of the maximum degree which in this case grows with 
n. Polynomial time mixing of the Glauber dynamics has been shown ll20l for a constant number of colours 
in terms of A. 

1.4 Open Problem 

If the probability that the leaves uniquely determine the spin at the root does not go to as n goes to infinity 
then the model has reconstruction. It is natural to ask is this a necessary condition for reconstruction. When 
k = 5 and A = 14 it was shown in |[T5l using a variational principle that reconstruction holds but the 
probability that the leaves fix the root goes to 0. However, this is the only case in which the variational 
principle gives an upper bound on the number of colours required for reconstruction which is better than the 
bound of the leaves fixing the root. It remains open to determine if for large numbers of colours/degrees if 
this is exactly the reconstruction threshold. Numerical results of [}22l suggest this is in fact not the case and 
there are two separate thresholds. Answering this question would be of significant interest. 
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2 Proofs 



We introduce the notation we use in the proofs. We denote the colours by C = {1, . . . ,k} and let T be the 
A-ary tree rooted at p. Let ui, . . . , ua be the children of p and let Tj denote the subtree of descendants of 
Uj. Let P(o-) denote the free measure on colourings on the A-ary tree. Let L{n) denote the spins at distance 
n from p and let Lj{n) denote the spins on level n in the subtree Tj. Let U{n) and L'j{n) respectively 
denote and Lj{n) conditioned on dp = i. For a boundary condition L on the spins at distance n from 
p define the deterministic function /„ as 

fn{i,L)=P{a, = i\L{n)=L). 

By the recursive nature of the tree we also have that 

fnii,L) = P{auj = i\Lj{n) = L). 

Now define Xi (n) = Xi by 

Xi{n)= Ui,L{n)). 

These random variables are a deterministic function of the random configuration L{n) of the leaves which 
gives the marginal probability that the root is in state i. By symmetry the Xi are exchangable. Now we 
define two distributions 

X+ =X+{n)=U{l,L\n)) 

and 

X~ =X-{n) = fn{l,L\n)) 

which are different distributions because L^{n) and L'^{n) are different distributions of configurations on the 
leaves. We will establish non-reconstruction by showing that X^ and X~ both converge to ^ in probability 

k' 



as n goes to infinity. Let Xn and Zn denote EX^{n) and E{X~^{n) — i)^ respectively. By symmetry we 



have 

I X otherwise, 

and the set {fn{i, L^{n)) : 2 < i < k} is exchangeable. Moreover they are conditionally exchangeable 
given fn{l, L^{n)). Now define 

Yij=Yij{n)=fn{i,L]{n)). 

This is equal to the probability that cr^^ = i given the random configuration Lj{n) on the spins on level n in 
the subtree Tj . As the configurations on the subtrees are conditionally independent given the spin at the root 
we have that the random vectors Yj = {Yij, . . . , Ykj} are conditionally independent. Further given aj and 
Y^^j the random variables {Yij : i aj} are conditionally exchangeable. We make use of these symmetries 
to simplify the anaylsis. Given the standard Gibbs measure recursions on trees we have that 

/„+i(l,L(n+ 1)) - 



Ef=inf=i(l-/n(i,i,(n))) 
and so 

X+{n+l) = ^^ 
Ell 

where 

A 

Z, = J{{l-Y,,). 
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The following lemma, which can be viewed as the analogue of Lemma 1 of Q, allows us to relate the first 
and second moments of X~^. 

Lemma 1 We have that 

k k 
xn = EX+ = E^P{ap = i\L\n))^ = Ej2iMn)f- 

1=1 i=l 

and 



x^-\ = EX+ -\ = E Y^iMn) - If > E{X+ - = z„. 



Proof: From the definition of conditional probabilities and of fn and the fact that P{ap = 1) = ^ we have 
that 

Efnil, L\n)) = fn{l, L)P{L{n) = L\cjp = 1) 

L 

_^ P(L(n) = L,ap = l) ^ 
= kY,P{L{n) = L)Ul,Lf 

L 

= kE{Xi{n)f 

k 

= EY,iMn)? 



i=l 



and 



k k k 

eY,{XM) - \f = EY,{X,{n)f - IeY^XM) + 4 = " \- 



k ^ ' k-^ 
1=1 1=1 



Corollary 1 We have that Xn > ^ and that 

lim Xr, 

n 

implies non-reconstruction. 

Proof: We have that x„ > z„ + ^ > p If x„ converges to \ then 



1 

k' 



fe / . X 2 

Y.^E[xln)--^ ^0 



which implies non-reconstruction. 
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2.1 Non-reconstruction 



Our analysis is split into two phases, the first when Xn is close to 1 and the second when Xn is close to ^ . 



Lemma 2 Suppose that (3 <1 — log 2. Then for sufficiently large k if IS. < k\[og k + log log k + P] then 

2 



limsupx„ < 



k 



Proof: We fix the colour of the root to be 1 let JT denote the sigma-algebra generated by {a^ : 1 < j < A} 
the colours of the neighbours of the root. For 2 < i < /c let 6/ = #{j : = i}, the number of times each 
colour appears amongst the neighbours of the root. For 1 < i < k define 



n 



l<i<A:<T„^.=j 



We will use the symmetries and exchangeability of the model to reduce the problem to considering a random 
variable only involving the C/j. Conditional on T, the Ui are. independent and are distributed as the product 
of bi independent copies of X~^{n) and < Ui < 1 for all i. Fix an £ with 2 < £ < k. Let Wi and W£ be 
defined by 



l<j<A:au^^£ 



l<i<A:a„^.^£ 



SO Zi = WiUi. Define 
and 

and for i {1, £}, 



WiUi, 



l<i<A:<T„^- = 

Zi, = Zi. 



(1-^1 



As we noted earlier Yj = {Yij, . . . , Ifcj} are conditionally independent given T and for each j given Oj and 
Yojj the random variables {Yij -.i^Gj] ai^e conditionally exchangeable. It follows that 

(T^i, VF,, Zi, . . . , Zfc, C/i . . . , [/fc, ai, . . . , c7a) 



We,Wi,Zi, ...,Zk,Ui...,Uk,ai,...,aA] ■ 



(1) 

where we denote equality as in distributions of random vectors. Since {Wi + Y17=2 ^i) ~ + X^r=2 ^i) ~ 
{Wi — W£){1 — Ue) has the same sign as Wi — We then ^r^j^j-n — ^ — ^ _^^n — j- has the opposite sign as 
Wi — Wi. Applying the equality in distribution of equation ^ we have that 



E 



■1e 

2 



1 



We-Wi 



Wl + E1=2Z^ 

Wi-Wi 



Wi 



+ 



Wt 



-E 
2 

> 



Wl+Y:^=2Z^ W, + Y:=2Z^ 

[Wi -Wi)' ^ 



1 



Wl + Y.l=2Z^ w,^y:Uz^ 



{Ui\ 
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where the first equaUty follows using equality in distributions of the random vectors and the inequality 
follows from the two terms of the product having the same sign. Since < Zi < Wi < 1 we have that, 



E 



^, {U^] 



< E 

< E 

< E 



Wi 



^1 + YJi=2 Zi 



and so 



and hence 



E 



i=\ 



E{X'^{n + \)\T,{\Ji}\ =E 



Zi 



y:uz. 



< 



1 + Y.U 



Now using the fact that = jj^ s^ds we have that 

1 



1 + Er=2 

As is convex as a function of u we have that s" < 5*^(1 — n) + s^u when < n < 1 and so since 
< C/j < 1 we have that Es^^ < (1 - EUi) + sEUi = 1 - (1 - s)EUi. Since conditional on T the Ui are 
independent and are distributed as the product of hi independent copies of X'^iii) we have that, 

1 ^ 

E{X^(n^\)\T\< I fl{l-{l-s)E[Ui\T])ds 

•^0 i=2 

= f'T\(^-{l-s){l-Xn)'^)ds. 
■^0 i=2 

Now the colours are chosen independently and uniformly from the set {2, . . . , A;} so (62, . . . ,bk) has 
a multinominal distribution. Let /3 < /?* < 1 — log 2 and let 6, be iid random variables distributed as 
Poisson(L') where D = log A; + log log /c + /?*. By Lemma |4] we can couple the 6's and 6's so that 
(62, ■ ■ ■ ,bk) < {h2i • • • , hk) whenever Yl^j=2 — ^- follows that 

Xn+i = EX+{n+l) 
<P + 



<E1,^, / El[{l-{1-S){l-Xn)'^)ds 



1 — s) exp(— ^ ds 



Jo 

<p+ / exp {—{1 — s){k — 1) exp{—XnD)) ds 
Jo 



p + 



1 — exp {—{k — 1) exp{—XnD)) 
{k - l)exY){-XnD) 
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where p = P(Poisson((fc — l)D) < A). Now p = exp(— r2(-^)) = o{k ^) and the function 

1 - exp(-(A; - l)exp(-yi:»)) 

g{y) =p+ — — — 

[k - l)exp{-yD) 

is increasing in y so the result follows by Lemma[3] 
■ 

Lemma 3 Let yQ, yi, ... be a sequence of positive real numbers such that yo = 1 cind Un+i = g{yn) where 

giVn) =P+ '"7i:i);xp('lTfg)"''" ' D = logk + log log k + P*,P* <1- log 2andp = oik^^). Then 
for large enough k, 

2 

limsupxn < — . 

n 



Proof: Since ^ ^"^ 



dx X 



= — i we can find e, 5 > such that when Q < x < 5, then 
< 1 - - - e X. 



X V 2 



Wealsochooser' > r > Osuchthat (| — e)e ^* > e ^(1+r'). Now for large enough /c, exp(—D) 
^^fctog/ < and so 

/I \ (A: - l)e-^* (l + r)e-i e'^ 



2 J k log A; log A; log k 

Now since (7 is a continuous increasing function and yi < yo it follows that the sequence yi is decreasing. 
Suppose that {k — 1) exp(— ?/jZ)) < 6. Then 

yi+i < p + 1 - Q - (A: - 1) exp(-yjD) 

and so 

1 - Vi+i > Q - (A; - 1) exp(-yiL») - p 

f\ \ (k - l)e~^* 
-[2-') klogk - - ^ 

(1 + /)e-i 

> exp((l - yi) log k)-p 

log k 

> {l+r'){l-y,) -p 

> (l+r)(l-2/,) 

where the second last inequality uses the fact that > ex and the final inequality uses the fact that l — yi> 
jlp: while p = o{k~^). It follows that yi decreases until for some i, {k — 1) exp(— 7/^1?) > 5. Now let 

^—j— = a' < a" < a < \ for some a. When k is large enough then 

Vi+i < pH J < a". 
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Then again for k large enough, exp(— > exp(— > exp(— a log k) = k It follows that 

(I. 1^ \ m ^ 

(A; - l)exp(-yi+ii;) 
Finally we have exp(— > exp(— 2A;"~^Z)) > | and so 

y*+3 -P+ 71 — T^ — -f m < ^^"^ 

(A; - l)exp(-yi+2-C>) 

when k is large enough which completes the proof. 
■ 

In the preceding lemma we note that the requirement that P* < 1 — In 2 comes from the fact that x < ^e^~^* 
for all X when /3* < 1 - In 2. 

Lemma 4 Suppose that {bi, . . . , hk) has the multinominal distribution M{n, ^, . . . ^)). Let bj be iid 
random variables distributed as Poisson{D). We can couple the b's andb's so that (bi, . . . ,bk) < {bi, . . . ,bk) 
(respectively >) whenever Yl^=i — (respectively <). 

Proof: Since the bj are independent and Poisson, conditional on the sum N = J2j=i bj, the distribution 
of (6i, . . . , bk) is multinominal M{N, . . . ■^)) (see |[T4l Proposition 6.2.1). Now if n < m then two 
multinomial distributions A and B distributed as M{n, ^, • • • j:)) and M{m, ^, ■ ■ ■ j:)) respectively 
can be trivially coupled so that A < B which completes the proof. ■ 

Janson and Mossel ||9l studied "robust reconstruction", the question of when reconstruction is possible from 
a very noisy copy of the leaves. They found that the threshold for robust reconstruction is exactly the Kesten- 
Stigam bound. Lemma|2]establishes that the leaves provide very little information about the spin at a vertex 
a long distance from the leaves. So as information over long distances is very noisy the results of ||9l suggest 
that reconstruction would only be possible after the Kesten-Stigam bound whereas, in our context, A is 
much less than A2(M)~^. As such only crude bounds are needed to establish the following lemma. 

Lemma 5 For sufficiently large k if A < 2k log k and if Xn ^ f then 

_ 1 If _ 1 
^"+' fc - 2 V^" k 

Proof: Using the identity 

1 1 r 1 

n + 



s + r s s^s + r 

and taking s = E Yli=i and r = Yli=i i^i - EZi) we have that 

1 _ Zi-iEliZ. 



Xn+l - Y = E 



k 



Ell 



et!1.=iZ, {ey!1=^z^^ 
z^-lT.Uz.{^Uz.-EZ.) 
T!l=lZ^ [eeIiZ, 



+ 
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Now by Lemma m 



3A 



i)) 



l + (fc-l) (l-^(Xn-i)) 

~k. 



Using the inequality ^(a^ + 6^) > ab we have that 



= - (^Zi - EZi) + ^EZi - i ^ ^Z,^ - ^ ^ - EZi) 

1 1 \ 

i=l 



<l|z,-i;z,|= + (i + i 



SO by Lemma [6] we have that, 

E 



Z 



< 



k 

k-1 
k 



1=1 

2A 



,i=l 



4A , / 
— + 4A 

k 



and 



E 



Zl-\ Ei=l Zi ) ( Ei=l(^i - 



<- 



ET!i=iZ. 



1' 



^5A 



A; 



Finally since < 



Zi 



< 1 we have that 



ry 1 '7 



< 1 and so 



E- 



Zi - I Z, (^'=i(^. - EZ,)) ^ ^ [EUz. - EZ:, 



YLlZ^ 



< E- 



5A 



ET:=iZi 
1 

k 
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Combining equations Q, Q and (O we have that 



1 13A / 1\ 1 



k - r" kj- 2 k 



and for large enough k, which completes the result. 



Lemma 6 For sufficiently large k if A < 2k log k and if Xn < f then the following all hold 

k-l\^ /k-l\^ 2A / 1 



and for i ^ 1, 



<EZi<[^] (l + _(x„ ^ 



/t-n"^/. 2A / i\\ „„ fk-i^^ 



l--^\x„--\\ <EZi< 



VarZi < ( ] —\Xn ^ 

Proof: 

From equation ([TS] ) we have that 

/A;-l 1 / 1^^^ 



and so since > ^ by Corollary [T] we have that 



k-1-^ 



k 

Then since exp(x) = 1 + x + O(x^) and ^^^^^ (x„ — ^) is small for large k, 

k-l\^ f kA f 1 



k-l\^ / 2A / 1 



^1— j 1,' + -^^"-*; 

which establishes equation Equations Q,® and ^ are established similarly. 
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2.2 Reconstruction 



An upper bound on the number of colours needed for reconstruction is found by estimating the probability 
that the colour of the root is uniquely determined by the colours at the leaves. This method was described 
in |[T9|| and used to a higher level of precision in ||2TI . We restate the result and give a full proof for 
completeness. 

Lemma 7 Suppose that (3 > 1. Then for sufficiently large k if A > k[\og k + log log k + P] then the colour 
of the root is uniquely determined by the colours at the leaves with probability at least 1 — j^^, that is 

mlP{X+{n) = 1) > 1 



log k 



Proof: Let pn be the probabiUty that the leaves at distance n determine the spin at the root, that is pn = 
P(X+(n) = 1). We will show that when k is large then liminf„p„ is close to 1. 

Suppose we fix the colour of the root to be 1 let T denote the sigma-algebra generated by {cr„j : 1 < j < A} 
the colours of the the neighbours of the root. For 2 < i < /c let 6j = #{j : au = i}, the number of times 
each colour appears in the neighbours of the root. Now each colour (T„^ is chosen uniformly from the set 
{2, . . . ,k} so {b2, ■ ■ ■ ,bk) has a multinominal distribution. Let /?>/?*>! and let bi be iid random 
variables distributed as Poisson(L') where D = log k + log log k + /3*. By Lemma|4]we can couple the 6's 
and 6's so that (62, • • • , > (&2i • • • 1 whenever X]i=2 — ^i=2 ~ &a.ch colour 2 < i < k 

there is some vertex uj such that the leaves in L](n) fix the colour of uj to be i then the leaves L^(n + 1) 
fix the colour of p to be 1. Conditional on T the probability that there is such a vertex uj for a given colour 
i is at least 1 — (1 — Pn)^^- Moreover these are conditionally independent of so it follows that 

k 

P 



'n+l>llE[l-{l-pn)''^\J^ 

i=2 
k 

>l[E[i-{i-pr,y 



1=2 

(1 - eM-PnD)f-^ - s 



where s = P(Poisson((A; — l)D) > A) = o{k ^). Now /(x) = (1 — exp(— ^ — s is increasing in 
X and when k is large enough 



1 

> 1 



log k 
and since po = 1, 

which completes the proof. 



inf p„ > 1 - 

n log k 
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2.3 Main Theorem 



Proof: (Theorem [T]) Combining Lemmas |2] and [5] establishes non-reconstruction when A < A;[logfc + 
log log A; + 1 — ln(2) — o(l)]. Lemma |7] shows that the root can be reconstructed correctly with proba- 
bility at least 1 — which establishes reconstruction when A > A; [log k + log log k + 1 — o(l)]. ■ 

Remarks 

For large k the Poisson(A) distribution is concentrated around A with standard deviation 0(\/A) which is 
significantly smaller than the errors bounds in Theorem[T] With some minor modifications the bounds for A- 
ary trees can be extended to Galton-Watson branching processes with offspring distribution Poisson(A). The 
reconstruction of Galton-Watson branching processes with offspring distribution Poisson(A) is of interest 
because, as noted before, it is believed to be related to the clustering transitions for colourings on Erdos- 
Renyi random graphs. 

To be more specific for the proof of non-reconstruction we can again bound Xn = EX~^ where the expected 
value is taken over all possible trees. In Lemma|2]we repeat the same bounds on x„, the only difference being 
A is now random, which does not affect the results for large k. Then similar estimates can be made in Lemma 
|5]provided ^ — -1) is very small. As A is concentrated around it's expected value the probability of 
this not holding is very small and this can be used to complete the proof of non-reconstruction. 
When (3 > (3* > \, with probability going to 1 as /c goes to infinity, the Galton-Watson branching process 
contains a subgraph which is a (A; [log k + log log k + /3*])-ary tree rooted at p. Reconstruction then follows 
from Lemma |7] 
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A Appendix 

Observe that since EX+ + {k - 1)EX- = 1 we have that EX+ - ^ = -{k - 1){EX- - i). We will 
show that the means and variances of the Yij and Zi can all be calculated in terms of Xn and Zn- 

Lemma 8 We have the identities 

EY., = l-^,i^n-l) (10) 



^^H = ^+ ,7, ~A xn-T]-T—rZn. (H) 



1 k-2 / 1\ 1 



For2<i<k, 



and 



k{k-i)\ " kj k-i " 

n 1 A;2-2/c + 2 / 1\ 1 

EY^^ = T^ + ^7^ TT^ h^n - T + T, TT^^n- (13) 
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For any 1 < ii < ^2 < ^. 

Cov{Y,,j,Y,,j)<Q (14) 



Proof: When the root is conditioned to be 1, aj / 1 and so Yij is distributed as X and we have that, 
EY,, = EX- = i - ^[£.Y- - i] = i - ^(.„ - i) 



and 



k 



k ^ 

4 = 1 



^ - E{X+f] 



k-1 



1 / 1\ 1 

where the third equality follows from Lemma [T] For 2 < i < k v/e have that 

and again using Lemma [T] 

i=l 

_ 1 k"^ -2k + 2 / 1\ 1 



Also for 2 < i < A;, 



1 ^ 



i'=2 



so Cov{Yij,Yij) < 0. Finally for 2 < n < ^2 < A; 

k 



Var(l - y^) = XI Var(yi,) + (A: - 1)(A: - 2)Cov(l^,,,-, y,,,) 



i=2 
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and so 

k 

Cov{Y,,j,Yi,j) = Var(l - Yij) - Vai-(y,,) 

i=2 

< Var(X-) - ((A; - 2)Var(X^) + Var(X+)) 

< 

soCow{Yi,j,Yi^j) < 0. 
■ 

Using Lemma [8] we can calculate the means and covariances of the Zj. 
Lemma 9 We have the following results 

,k-l 1 / 1\.A 



For each 2 < i < k then 



and 

k J ' k{k-l)^ V'"" f^J ' (k-l)^ 



, A:2-4/c + 2/ 1\ 1 \^ 
EZf=\{ —— + — x„ - + zn (18) 



for any \ <ii < i2 <k, 

Cov{Z,,j,Zi,j)<0 (19) 

Proof: By equation (flOl ) we have that 

A 

i^zi=i?n(i->^ii) 

1 / 1^^^ 



+ 1 T X 



k k-1 \ k^ 

which establish equation ([T5] ). Equations (fT6l) . (fTTl) and (fTSl ) follow similarly. Using equation ([14)) we have 
that for 1 < ii < i2 < k, 

A 

EZ,,Z,, = Ellil -Y,,j){l - Yi,,) 
i=i 

A 



EZi-^EZi^ 



which establishes equation ( [T9] ). 
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